Fp-growth Based New Normalization Technique for Subgraph Ranking

نویسندگان

  • E. R. Naganathan
  • S. Narayanan
  • K. Ramesh kumar
چکیده

The various problems in large volume of data area have been solved using frequent itemset discovery algorithms. As data mining techniques are being introduced and widely applied to non-traditional itemsets, existing approaches for finding frequent itemsets were out of date as they cannot satisfy the requirement of these domains. Hence, an alternate method of modeling the objects in the said data set, is graph. Modeling objects using graphs allows us to represent an arbitrary relation among entities. The graph is used to model the database objects. Within that model, the problem of finding frequent patterns becomes that of finding subgraphs that occur frequently over the entire set of graphs. In this paper, we present an efficient algorithm for ranking of such frequent subgraphs. This proposed ranking method is applied to the FP-growth method for discovering frequent subgraphs. In order to find out the ranking of subgraphs we present a new normalization technique which is the modified normalization technique applied at each position for a chosen value of Discounted Cumulative Gain (DCG) of a subgraph. Instead of DCG another modified approach called Modified Discounted Cumulative Gain (MDCG) is introduced. The MDCG alone cannot be used to achieve the performance from one query to the next in the search engine’s algorithm. To obtain the new normalization technique an ideal ordering of MDCG (IMDCG) at each position is to be found out. A Modified Discounted Cumulative Gain (MDCG) is calculated using “lift” as a new approach. IMDCG is also evaluated. Then the new approach for finding the normalized values are to be computed. Finally, the values for all rules can be averaged to get an average performance of a ranking algorithm. And also the ordering of obtained values as a result at each position will provide the order of evaluation of rules which in turn gives an efficient ranking of mined subgraphs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FP-Rank: An Effective Ranking Approach Based on Frequent Pattern Analysis

Ranking documents in terms of their relevance to a given query is fundamental to many real-life applications such as document retrieval and recommendation systems. Extensive studies in this area have focused on developing efficient ranking models. While ranking models are usually trained based on given training datasets, besides model training algorithms, the quality of the document features se...

متن کامل

Improving Human Activity Recognition Through Ranking and Re-ranking

We propose two well-motivated ranking-based methods to enhance the performance of current state-of-the-art human activity recognition systems. First, as an improvement over the classic power normalization method, we propose a parameter-free ranking technique called rank normalization (RaN). RaN normalizes each dimension of the video features to address the sparse and bursty distribution problem...

متن کامل

Presenting a Model for Ranking Insurance Organizations Based on Electronic Readiness Using a Combination of ANP and DEMATEL Techniques

Activities of insurance, like other work fields, have been greatly influences by new technologies; therefore, they must be at the suitable level of electronic readiness to provide more effective information services in line with the needs of new era. But the question is, are the insurance companies in Iran ready to use the e-commerce opportunities efficiently? To answer, it is necessary t...

متن کامل

An Improved Page Rank Algorithm based on Optimized Normalization Technique

Page Ranking is an important component for information retrieval system. It is used to measure the importance and behavior of web pages. We review two approaches for ranking: HITS concept and Page Rank method. Both approaches focus on the link structure of the Web to find the importance of the Web pages. The Page Rank algorithm calculates the rank of individual web page and Hypertext Induced To...

متن کامل

Presenting a Model for Ranking Insurance Organizations Based on Electronic Readiness Using a Combination of ANP and DEMATEL Techniques

Activities of insurance, like other work fields, have been greatly influences by new technologies; therefore, they must be at the suitable level of electronic readiness to provide more effective information services in line with the needs of new era. But the question is, are the insurance companies in Iran ready to use the e-commerce opportunities efficiently? To answer, it is necessary t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011